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ABSTRACT: 

When a flow graph is created for a program including a 
complex if statement, 

the respective branch conditions in the complex if 
statement are separated and 

the flow graph designating the control flow corresponding 
to the logical value 

of respective branch conditions are created. Next, to 
perform the branch 

probability determining/allocating process, in the above 
flow graph, the value 

corresponding to the number of executions of respective 
paths is calculated so 

that the whole branch probability of the complex if 
statement designated by the 

optimization designating statement and the branch 
probability of respective 

branch conditions is assigned to the path corresponding to 
respective branch 
conditions • 

13 Claims, 19 Drawing figures 
Exemplary Claim Number: 10 
Number of Drawing Sheets: 11 
BRIEF SUMMARY: 

(1) BACKGROUND OF THE INVENTION 

(2) 1. Field of the Invention 

(3) The present invention relates to a method of 
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performing a compilation 

process to optimize execution speed of a program, by 
determining the branch 

probability of a branch statement in a program which 
includes a plurality of 

branch conditions, and further to an apparatus for 
performing the compilation 
process . 

(4) 2. Description of the Related Art 

(5) Increasing the speed of a process of a computer 
system, is being sought a 

compiler for translating a program at a high execution 
efficiency has been 

demanded. Such a function of the compiler is called 
optimization. 

(6) A VLIW (very long instruction word) computer capable 
of executing a 

plurality of instructions in parallel by merely designating 
a single long 

instruction word at every instruction execution cycle has 
been recently noted. 

(7) When a program to be executed by the VLIW computer is 
compiled, a 

plurality of instructions able to be executed in parallel, 
because they do not 

share an arithmetic operating unit or data path and do not 
require any results 

of operations performed by the other instructions, may be 
designated by a 

single long instruction word. As a result, optimization is 
achieved. By 

performing such optimization, the rate of executing 
respective instructions of 

a program to be performed in parallel can be increased and 

the execution time 

of a program can be shortened. 

(8) The above technical background requires technology 
for performing the 

compilation, and detecting, from a program to be processed 
a plurality of 

instructions, able to be executed in parallel, because they 
do not share an 

arithmetic operating unit or a data path and do not require 
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any results of 

operations by the other instructions. 

(9) Generally speaking, when a branch instruction exists 
in the program which 

is compiled, an instruction succeeding the branch 
instruction is not always 

executed depending on the branch condition. Therefore, it 
is desired quite 

often to d etect a plurality _ of,, ins truct_ions__executable i n 
parallel wrt h a group 

of instructions, allocated between two branch instructions 
called a basig 

felosk^— However , an optimization performed for respective 
basic blocks, merely 

enables, at maximum, two instructions to be executed as a 
result of a 
statistical average. 

(10) Effective methods for solving such a bottleneck have 
been proposed. 

(see, e.g., a method described as trace scheduling by J. R. 
Ellis, Bulldog: A 

Compiler for VLIW Architecture, the MIT Press, 1986) . 

f(ll) According to the txace scheduling method, a train of 
instructions — — — — _— — 

(called trace) , which have many execution times, is 
detected in a program the 

instructions in the trace are sorted and parallelized to 
enable the trace to be 

executed with high efficiency. In this case, when a loop 
instruction appears 

the number of repetitions of the loop is determined, 
similarly, when a branch 

instruction appears, the probability of each branch is 
obtained. Based on the 

number of repetitions in the loop and the branch 
probability, a path with a 

large number of executions is selected and a particular 
trace is selected. 

Thus, a pi n ral i i-y of instructions can be executed in 
paraJJ eJ — hy_osiag— units of 

the trace which overrid es a plurality of basic blocks an d 
has- _a larg er numb e r 

of expeditions. Consequently, the ratio of instructions 
executable in parallel 

can be increased and the execution speed of the program can 
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be improved. 



(12) Therefore, it is very important to obtain the number 
of repetitions of a 

loop instruction and a branch probability of the branch 
instruction . 

(13) Additionally, it is useful to obtain the probability 
of the branch 

instruction. 

(14) Namely, for example, the branch probability of each 
branch instruction 

is necessary to detect a path with a high execution 
frequency when an 

optimization is performed, to enable the resources, such as 
a register, to be 

assigned with a high priority to a path having a higher 

execution frequency 

when performing the compilation. 

(15) When a branch instruction, occurs the execution cost 
becomes higher than 

if it were not chosen. The execution time becomes longer, 
as compared to the 

case of non-occurrence of the branch, because an 
instruction cycle for setting 

a branch target address in a program counter is added when 
a branch occurs . 

(16) Therefore, if the probability of the occurrence of 
the branch at the 

branch instruction greatly exceeds 5 0 percent, the logic of 
the branch 

condition is reversed and the instruction train to be 
executed, in the case of 

the occurrence of the branch, is replaced by the 
instruction train to be 

executed when the branch does not occur to shorten the 
execution time 

statistically. When the necessity of the above 
replacement, is to be 

determined the branch probability of the branch instruction 

may also be 

required. 

(17) Further, in order to shorten the time required for 
the compilation to 
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perform an optimization, the level of optimization may be 
lowered for a path 

having a low frequency of execution. In such a case, a 
branch probability of 

the branch instruction may be required to determine the 
execution frequency of 

respective paths. For conventional optimization in a 
compilation process, the 

number of repetitions of the loop instructions is 
frequently shown in a source 

program and can be obtained with relative ease. 

(18) In contrast, the branch probability of the branch 
instruction can be 

determined for each respective branch instructions by the 
branch instruction 

being designated in the source program by a programmer or 
by using an automatic 

profile function resident in the computer system. The 
automatic profile 

function can measure the execution ratio of an instruction 
by compiling the 

program without performing the optimization. 

(19) When the branch instruction includes only a branch 
condition, the 

execution frequency of a path corresponding to the branch 
instruction can be 

determined immediately after the branch probability is 
determined. 

(20) In contrast, the prior art of a compilation process 
accompanied by an 

optimization when the branch instruction includes a 
plurality of branch 

conditions will be explained by the following FIGS. 1A 
through 1C . 

(21) The program 101 shown in FIG. 1A is written in a C 
language. When one 

of the branch conditions A, B or C is true, a basic block 
xx in a THEN clause 

is executed (the word "THEN" is implicit) , and if one of 
the branch conditions 

is not true, a basic block yy in an ELSE clause is executed 
and the basic block 

zz is executed after the completion of either of these 
clauses. A method of 
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performing a compilation process in the following two cases 
(FIGS. IB and 1C) 

has been known with regard to an IF statement, (which is 
called a complex IF 

statement) including a plurality of branch conditions in a 
statement 

representing one branch, as shown in this example. 

(22) As shown in FIG. IB, in a first compilation process, 
the plurality of 

branch conditions A, B and C, are calculated as a group. 
This first 

compilation process is advantageous in that the branch 
probability of a complex 

IF statement can be determined with ease since the branch 
decision is performed 

at a single place. A document corresponding to this prior 
art, is known as "A 

Method of Selecting Instructions Provided with Conditions 
upon Executing the 

Instructions" disclosed in Japanese Patent Publication 
Tokkohei 1-42019. 

(23) However, in the prior art shown in FIG. IB, the 
branch probabilities 

cannot be obtained for the respective branch conditions A, 
B and C. In the case 

of a basic block in which respective branch conditions A, B 
and C comprise a 

plurality of instructions, the first compilation process 
cannot achieve a 

precise optimization by selecting an optimum path from 
paths of the branches 

depending on respective branch conditions. 

(24) As shown in FIG. 1C, in the second compilation 
process, a decision 

involving a plurality of branch conditions is resolved into 
respective branch 

paths. Also, in the second compilation process, branch 
decisions are not 

performed at one place but are performed for respective 
branch conditions . 

Therefore, for example, when the branch condition A is 
true, the basic block xx 

in the THEN clause is immediately executed and the 
determination of the other 

branch conditions is not executed. Consequently, with 



10/03/2003, EAST Version: 1.04,0000 



regard to the same 

source program, an execution time of an executable program 
generated based on 

the compilation process shown in FIG. 1C is statistically- 
shorter than that of 

the executable program generated based on the compilation 

process shown in FIG. 

IB. 

(25) In the operations generated based on the compilation 
process shown in 

FIG. 1C, a branch decision is performed for individual 
branch conditions in a 

complex IF statement. Thus it is possible to perform a 
precise optimization by 

selecting an optimum path among the paths which are 
branched under the 
individual branch conditions. 

(2 6) Accordingly, it is important to know how to 
determine respective branch 

probabilities of the individual branch conditions based on 
the branch 

probability allocated for the whole complex if statement. 
However, the method 

of enabling determination of such branch probabilities is 
not known. 

(27) SUMMARY OF THE INVENTION 

(28) An object of the present invention residing in the 
above described 

background, is to enable the branch probabilities of the 
respective branch 

conditions to be allocated based on the branch probability 
allocated for the 

whole complex IF statement and to predict the frequency of 
the execution with 

high precision so that an executable program with high 
efficiency is created. 

(29) The present invention creates a flow graph, for the 
branch statement 

representing one branch including a plurality of branch 
conditions. This flow 

graph comprises nodes, representing basic blocks, including 
the respective 

branch conditions in the branch statement. The flow graph 
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further comprises a 

node representing a basic block to be executed where the 
conditional decision 

results in the branch statement being true and a node 
representing a basic 

block to be executed where the conditional decision results 
in the branch 

statement being false. This flow graph can be created as 
block connection 

relation data (corresponding to the connection matrix 
table) having the 

connection relation between respective basic blocks as the 
elements . 

(3 0) The branch probability of the path between 
respective nodes can be 

calculated as the branch probability of the .branch 
condition included with the 

basic block corresponding to the node from which the path 
is outputted, based 

on the number of paths between respective nodes, to satisfy 
the branch 

probability information designated for the whole branch 
statement . 

(31) According to the construction of the above present 
invention, it is 

possible to decrease the optimization level of the path to 
which the branch 

condition specifies branching to, when the path is 
determined to have a low 

execution frequency based on the branch probability. 

(32) According to the construction of the present 
invention, it is possible 

to allocate the branch probability information for each 
respective branch 

conditions in one branch statement, and to predict a 
frequency of the execution 

each respective instructions in the program with high 
precision when performing 

the compilation operation. The present invention may be 
applied to performing 

scheduling based on the frequency of execution of the 
respective instructions, 

such as the trace scheduling method. This thereby enables 
the compilation 

process, for translating the program, to be performed and 
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the program to be 

operated with high efficiency. 

DRAWING DESCRIPTION: 

BRIEF DESCRIPTION OF THE DRAWINGS 

The other objects and features of the present invention 
can be easily 

understood by persons skilled in the art based on the 
description relating to 

the preferred embodiments of the present invention together 
with the attached 
drawings . 

FIG. 1A shows an explanatory view of the prior art; 

FIG. IB shows an explanatory view of the prior art; 

FIG. 1C shows an explanatory view of the prior art; 

FIG. 2 shows an explanatory view of principle of the 
present invention; 

FIG. 3A shows an explanatory view of a second principle 
of the present 
invention; 

FIG. 3B shows a second explanatory view of a second 
principle of the present 
invention; 

FIG. 3C shows a third explanatory view of a second 
principle of the present 
invention; 

FIG. 4 shows a structural view of the preferred 
embodiment of the present 
invention; 

FIG. 5 shows an operational flowchart in the information 
collecting phase in 

the preferred embodiment of the present invention; 

FIG. 6A shows one part of an operational flowchart in a 
connection matrix 

creating phase of the preferred embodiment of the present 
invention; 
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FIG. 6B shows the further part of the operation 
flowchart in a connection 

matrix creating phase of the preferred embodiment of the 
present invention; 

FIG, 7 shows an operational flowchart in the connection 
matrix calculating 

phase of the preferred embodiment of the present invention 

FIG, 8A shows an explanatory view of the data structure 
of the basic block; 

FIG. 8B shows an explanatory view of a dummy block; 

FIG. 9A shows one view of the example of the connection 
matrix; 

FIG. 9B shows a second view of the example of the 
connection matrix; 

FIG. 9C shows a third view of the example of the 
connection matrix; 

FIG. 9D shows a fourth view of the example of the 
connection matrix; and 

FIG. 10 shows an explanatory view of an operation of 
another embodiment of 
the present invention. 

DETAILED DESCRIPTION: 

(1) DESCRIPTION OF THE PREFERRED EMBODIMENT 

(2) Explanation of the Principle 

(3) The principle of the present invention will be 
explained. 

(4) In FIG. 2, program 201 is an example of a program 
subject to a 

compilation process. In program 201, statement 1 
designates an optimization 

designating statement, statement 2 shows a complex IF 
statement, and xx in 

statement 3 represents a train of instruction statements 
which should be 
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executed when the branch condition of the complex IF 
statement is true known as 

a THEN clause, (The word then THEN is implicit in the 
complex IF statement,) 

yy in statement 4 represents a train of instruction 
statements which should be 

executed when the branch condition of the complex IF 
statement is false, knows 
as an ELSE clause. 

(5) The branch conditions A, B, C, D and E, xx in the 
THEN clause and yy in 

the ELSE clause respectively comprise a continuous train of 
instructions which 

are stopped or branched therefore, this is called a basic 
block . 

(6) In the program 2 01, the optimization designating 
statement of statement 1 

designates that the branch probability of the complex IF 
statement designated 

by statement 2, specifically, the truth probability in 
which the THEN clause 

designated by statement 3 (or xx) is executed, is 40 
percent . 

(7) In flow graph creating process 202, designated by the 
following processes 

1 and 2, with regard to program 201, a plurality of branch 
conditions A, B, C, 

D and E, in the complex IF statement designated by the 
statement 2, are not 

treated as a group. Instead individual branch conditions 
are separated, 

thereby creating a flow graph 2 04a which designates a 
control flow 

corresponding to the logical value of the respective branch 
condition. 

(8) Process 1: The complex IF statement designated by 
statement 2 is resolved 

into individual branch conditions. The THEN clause, ELSE 
clause and individual 

branch conditions are allocated with respective marks. 

(9) Process 2: Flow graph 2 04a is created for the branch 
conditions each 

allocated with the marks. 
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(10) The flow graph 2 04a created by the above recited 
processes 1 and 2 is 

comprised of intermediate nodes A, B, C, D and E, 
respectively corresponding to 

branch conditions A, B, C, D and E; a final node xx, 
corresponding to the THEN 

clause xx; a final node yy corresponding to the ELSE clause 
yy; and paths 

connected between the respective nodes. 

(11) In the branch probability determining/allocating 
process 203, designated 

by the following processes 3 to 8, the values corresponding 
to the numbers of 

executions by respective paths are calculated. 
Consequently the whole branch 

probability (true ratio equals 40 percent) of the complex 
IF statement of 

statement 2 designated by the optimization designating 
statement of statement 1 

is satisfied by the flow graph 2 04a. Branch probabilities 
of respective branch 

conditions are then assigned to the nodes corresponding to 
the respective 
branch conditions. 

(12) Process 3: The number of executions is allocated for 
final nodes xx and 

yy based on the designation by the optimization designating 
statement of 

statement 1. In the example shown in FIG. 2, the number of 
executions at node 

xx is allocated as 40 and the number of executions at node 

yy is allocated as 

60. 

(13) Process 4: The respective numbers of paths input to 
respective nodes xx 

and yy are searched. In the example of FIG. 2, two paths 

are input to node xx 

and three paths to node yy. 

(14) Process 5: The numbers of executions at the 
respective nodes xx and yy 

are divided by the respective numbers of paths inputted to 
respective nodes. 

The respective results of the divisions are determined as 
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the respective 

numbers of executions of respective paths input to the 
node. In the example of 

nodes xx and yy, the following allocation is performed. 

(15) The number of executions for the paths from node D 
to node xx is 

allocated as 40/2=20 (40 total executions at xx/2 paths 
inputting to xx) . 

(16) Similarly the number of executions for the paths 
from node E to node xx 

is allocated as 40/2=20. 

(17) The number of executions for the paths from node B 
to node yy is 

allocated as 60/3=20 (60 total executions at yy/3 paths 
inputting to yy) . 

(18) Similarly, number of executions for the paths from 
node C to node yy is 

allocated as 60/3=20. 

(19) Similarly again, the number of executions for the 
paths from node E to 

node yy is allocated as 60/3=20. 

(20) Process 6: For each intermediate node at which the 
numbers of the 

executions of all the paths output from intermediate node 
to other intermediate 

nodes is allocated, the sum of the numbers of the 
executions of all the 

outgoing paths is determined as the number of executions at 
the intermediate 

node. In the example shown above, the number of the 
executions for the paths 

from node E to node xx is 20, the number of the executions 
for the path from 

node E to node yy is 20, and the number of paths output 
from node E is only 2 . 

Accordingly, the number of executions at node E is 20+20=40 
(times) . Then, the 

number of execution£--at— fcfee intermediate— riode, here node E, 
is divided by the 

number of paths input to the intermediate node, here node 
E, result of the 

division is determined to be the number of executions of 
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each respective paths 

input to the intermediate node . 

(21) Process 7: The above process 6 is repeated for each 
intermediate node 

and thus the numbers of executions along all the paths are 
calculated. As a 

result, the flow graph 2 04b shown in FIG. 2 is obtained. 

(22) Process 8: The numbers of executions along the 
respective paths are 

respectively divided by the number of executions at the 
node from which 

respective paths are outputted, thereby calculating the 
branch probabilities of 

respective paths. For example, the branch probability from 
the node A to node 

C is 40/100 =0.4 and the branch probability from the node B 

to node C is 

40/60=0.66. 

(23) By performing the above explained flow graph 
creating process 202 and 

branch probability determining/allocating process 203, the 
respective branch 

probabilities can be determined for respective branch 
conditions in a complex 

IF statement, thereby enabling precise optimization 
scheduling to be performed 
in a compilation process. 

(24) For example, performing a precise optimization based 
on the trace 

scheduling method, by selecting an optimum path with a high 
branch probability, 

from among paths branched under the individual branch 

conditions in the complex 

IF statement becomes possible. 

(25) Additionally, performing an optimization to allow 
resources, such as a 

register to be assigned with a high priority to the path 
with the higher branch 

probability from among all the paths branched under a 
respective branch 

conditions in the complex IF statement becomes possible. 

(26) When the branch probability of a path of a branch 
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occurring under 

respective branch conditions in the complex IF statement 
exceeds 50 percent 

significantly, the logic of the branch condition may be 
reversed and the 

instruction train to be executed, when the branch occurs is 
replaced by an 

instruction train to be executed when the the branch does 
not occur, thereby 

shortening execution time statistically. 

(27) Time required for a compilation of the program is \ 
shortened by ] 
decreasing the optimization level of the path, determined/ 
to have a low / 
execution frequency based on the branch probability 
allocated as described 

above. Conventionally, the optimization level is changed 
during a compilation 

process with regard to a specific program but the 
optimization level of the 

specific path in a single program is not changed based on 
the frequency of the 

executions. In order to perform the optimization, analysis 
the flow of control 

of the program and the flow of the data is necessary. The 
higher the level of 

optimization planned, the more precise an analysis is 
required, resulting in a 

longer time required for the compilation process. However, 
the path with a low 

frequency of execution is not executed frequently, as a 
result even if the 

optimization level is low for such a path, this does not 
contribute to an 

increase in the total execution time of the program. In 
some cases shortening 

the compilation time of the program is more beneficial than 
shortening the 

execution time of the program. Accordingly, the time 
required for the 

optimization process, which may be time-consuming, may be 
shortened by 

decreasing the optimization level of the path along with a 
low number of 

executions based on the allocated branch probability. 

(28) FIGS. 3A, 3B and 3C show explanatory view of another 
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principle of the 
present invention . 

(29) The above explanation is applied to the case where 
the ELSE clause 

exists in the complex IF, respectively statement. As shown 
in a program 301 in 

FIG, 3A, the same explanation made above may be applied 
where the ELSE clause 

does not exist- In this case, the probability at which the 
node zz is executed 

is actually 100 percent. However the branch probability is 
calculated under 

the assumption that node zz is executed at a probability of 
6 0 percent when the 

branch probability is calculated based on the principle 
recited above. As a 

result, as shown in a flow graph 3 02 in FIG. 3B, the 
numbers of executions 

travelling through respective paths are calculated and the 
branch probabilities 

of respective branch conditions are calculated. 

(30) In the example of the program 2 01 shown in FIG. 2, 
the branch conditions 

D and E are deleted and the control flow is changed when 
branch condition D is 

always true in the compilation process subject to the 
optimization. In this 

case shown in FIG. 3B, the control flow is examined 
immediately before the 

calculation of the branch probabilities. The the branch 
probabilities of 

respective branch conditions can be allocated correctly as 
shown in flow graph 
303 in FIG. 3C. 

(31) Explanation of Preferred Embodiments 

(32) The preferred embodiments of the present invention 
based on the above 

described operational principle is now explained. 

(33) FIG. 4 shows a structure diagram of a preferred 
embodiment of the 

present invention . 

(34) A processing apparatus 401, allocated with a CPU and 
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memory, executes a 
compilation process, 

(35) An information collecting phase processing unit 402 
executes a phase for 

collecting compilation information to create a flow graph, 

(36) A connection matrix creating phase processing unit 
403 executes a phase 

for creating a connection matrix representing a flow graph. 

(37) A connection matrix calculating phase processing 
unit 4 04 executes a 

phase for calculating branch probabilities (truth ratio) 

using a connection 

matrix. 

(38) The above processing unit can be realized as a 
function of part of a 

compiler for executing the compilation process. 

(3 9) The detail of the phase to be executed by the 
respective processing 

units is now explained according to the operational 
flowchart shown in FIGS. 5 

to 7. FIG. 8A shows an explanatory view of the data 
structure of the basic 

block; FIG. 8B shows an explanatory view of the dummy 
block; and FIGS. 9A, 9B, 

9C and 9D show a view of an example of the connection 
matrix . 

(40) Information collecting phase processing unit 402 
executes the process of 

steps 501 to 505 designated by an operational flowchart 
shown in FIG. 5. 

(41) The program, which is converted to an intermediate 
code by structure 

analysis and semantics analysis in a compilation process, 
is arranged in units 

of a basic block (which is merely called a block 
hereinafter) with regard to 

respective blocks, control flow information concerning a 
preceding block and 

concerning a succeeding block of the respective blocks can 
be obtained. 
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(42) FIG . 8A shows the data structure of a block (basic 
block) . The block 

has the data structure comprising a pointer to a list 
designating a parent 

block, a pointer to a list designating a child block, a 
pointer designating an 

instruction at the head of the block, a pointer designating 
an instruction at 

the end of the block and an attribute area. Control flow 
information relating 

to the preceding block and the succeeding block of the 
particular block 

obtained in step 5 01 is allocated respectively as a pointer 
to a list 

designating the parent block and a pointer to a list 
designating the child 

block in the structure defining the particular block is 
shown in FIG. 8A. 

(43) In step 502 in FIG. 5, a distinguishing mark is 
allocated at a block in 

which the branch condition in the complex IF statement 
comprising the branch 

probability designation is included. The distinguishing 
mark distinguishes the 

complex IF statement, including the branch condition of the 
block at which the 

distinguishing mark is allocated and actually comprises a 

unique address or 

flag. 

(44) In step 503, a THEN attribute is allocated at a 
block representing a 

THEN clause (which is called a THEN block hereinafter) and 
an ELSE attribute is 

allocated to a block representing an ELSE clause 
(hereinafter called an ELSE 
block hereinafter) . 

(45) A distinguishing mark, a THEN attribute or an ELSE 
attribute for a 

particular block allocated in steps 502 and 503 is 
allocated to an attribute 

area of the structure which defines the particular block 
shown in FIG. 8A. 

(46) Instead of allocating the attribute area in the 
structure of the block 
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as shown in FIG. 8A, a branch condition of a complex IF 
statement and a dummy 

instruction representing a THEN clause or an ELSE clause 
(which is represented 

as dummy- true or dummy- false in FIG. 8B) may be inserted as 
the block as shown 
in FIG. 8B. 

(47) In step 504, a truth ratio of the branch probability 
of the complex IF 

statement designated by the optimization designating 

statement (statement 1 in 

FIG. 2, for example) is obtained. 

(48) In step 505, a structure information table is 
prepared for the complex 

IF statement. The structure information table includes the 

following 

information; 

(49) Designated truth ratio 

(50) Pointers to structures of blocks including branch 
conditions and the 

number of pointers 

(51) A pointer to the structure of a THEN block 

(52) A pointer to the structure of an ELSE block 

(53) The above described process in the information 
collecting phase 

processing unit 402 corresponds to process 1, whereby the 
process is executed 

as part of the flow graph creating process 2 02 explained as 
the principle of 

the present invention by referring to FIG. 2. 

(54) Sequentially, the connection matrix creating phase 
processing unit 403 

executes the processes shown in the operational flowcharts 
in FIGS. 6A and 6B. 

(55) As shown in step 601 in FIG. 6A, the information 
collecting phase 

processing unit 402 refers to the structure information 
table prepared in step 

505 of FIG. 5. Step 602 creates a connection matrix having 



10/03/2003, EAST Version: 1.04.0000 



rows corresponding 

to n blocks, each including a branch condition, a THEN 
block and an ELSE block, 

and having columns corresponding to the n blocks, each 
including a branch 

condition with regard to each complex IF statement. For 
example, the 

connection matrix created from program 2 01 shown in FIG. 2 

is reflected in FIG. 

9A. 

(56) A block corresponding to a row in the connection 
matrix is called a row 

block and the block corresponding to a column is called a 
column block, and the 

portion where the row and the column intersect is called an 
element . 

(57) When the connection matrix is created in the step 
602, as an initial 

state of the connection matrix, the respective elements of 
the connection 

matrix have been allocated with an IP attribute, IS 
attribute, or DISCONNECT 

attribute. The IP attribute designates that the column 
block corresponding to 

the element precedes the row block in the control flow the 
IS attribute 

designates that the column block corresponding to the 
element succeeds the row 

block in the control flow, and the DISCONNECT attribute 
designates that the 

column block corresponding to the element is not related to 
the row block 

corresponding to the element in the control flow. The 
DISCONNECT attribute is 

represented by mark X in FIGS. 9A to 9B . 

(58) A further detailed operational flowchart of the 
connection matrix 

creating process shown in step 602 is represented in FIG. 
6B. 

(59) In step 603, the number n of blocks in which the 
branch condition is 

included is obtained. 

(60) In step 604, the array MATRIX (n+2, n) , where the 
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number of the row 

element is n+2 and the number of the column element is n, 
is generated. This 

array MATRIX represents a connection matrix. 

(61) In step 605, an attribute of all the elements of the 
array MATRIX is 

allocated as the DISCONNECT attribute at the initial state 
of preparing the 
array MATRIX. 

(62) Accordingly, the value of index i is incremented 
from 1 to n+2 by 1, in 

steps 606, 612 and 613 and the value of index j is 
incremented from 1 to n by 1 

with regard to the value of indexes i in steps 607, 610 and 
611, thereby 

repeating the execution of the following steps 608 and 609. 

(63) In step 608, it is determined whether the preceding 
block of i-th block 

is the j-th block. If the determination is YES, in step 
609, the IP attribute 

is allocated at the array element MATRIX (i, j) and the IS 
attribute is 

allocated at the array element MATRIX (j, i) . 

(64) When the processes for all the array elements are 
completed, the 

connection matrix creating phase processing unit 403 has 
completed its 

operation. As a result, an array MATRIX (n+2, n) 
corresponding to the 

connection matrix shown in FIG. 9A is obtained. 

(65) The process performed by the connection matrix 
creating phase processing 

unit 403 corresponds to the process 2, executing as part of 
the flow graph 

creating process 202 in the explanation of the principle of 
the present 

invention by referring to FIG. 2. 

(66) Accordingly, the connection matrix calculating phase 
processing unit 404 

executes the process, designated by the operational 
flowchart shown in FIG. 7. 
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(67) In step 701, respective numbers of paths inputted to 
the THEN block and 

the ELSE block are detected by referring to the structure 
information table and 

the structure of the block designated by the structure 
information table. The 

number of paths inputted to the THEN block is expressed as 
then. sub. -- 

path. sub.-- cnt and the number of paths inputted to the 
else block is expressed 

as else. sub.-- path. sub.-- cnt. This process corresponds 
to the process 4 

which is executed as part of the branch probability 
determining/allocating 

process 203 in the explanation of the principle based on 
FIG. 2. 

(68) Next, in step 702, the number of executions to be 
calculated by the 

following equation is allocated in the array element MATRIX 
(n+1, i) (i=l, . 

. , n) where the IP attribute is set. 

(1) The number of executions=the truth 

(69) is set in the structure information table and is 
designated for the 

entire complex IF statement which is the subject of the 
process, in accordance 

with an optimization designating statement. 

(70) Sequentially, in step 703, the number of executions 
to be calculated by 

the following equation is allocated in the array element 
MATRIX (n+2, i) (i=l, 

. . , n) in which the IP attribute is set. 

(2) The number of executions= ( 100- the truth 
ratio) /else . sub . - - path . sub . - - 

cnt . 

(71) The above-described steps 702 and 703 correspond to 
process 5 which can 

be executed as a part of the branch probability 
determining/allocating process 

203 in the explanation of the principle based on FIG. 2. 
As a result, the 

array MATRIX (n+2, n) corresponding to the connection 
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matrix shown in FIG. 9B 
can be obtained. 



(72) Sequentially, in step 704, the following steps 704-1 
to 704-4 are 

executed for the column in which the IP attribute does not 
exist and the IS 

attribute exists, namely, for a block in which the number 
of executions of all 

the paths outputted from the block to the other succeeding 
blocks is allocated. 

(73) In step 704-1, the sum of the numbers of respective 
executions 

sum. sub.-- cnt is calculated for elements in which the 
numbers of the 

executions are stored, along the column. For example, with 
regard to the 

column E shown in FIG. 9B, the sum, sub.-- cnt=20+20=40 . 

(74) In step 704-2, the number of elements which have the 
IS attribute along 

the column is calculated as element . sub . - - cnt. For 
example, with regard to 

the column E shown in FIG. 9B, the element . sub .- - cnt=l. 

(75) In step 704-3, the value of element (the number of 
executions) which 

have the IS attribute along the column is calculated based 
on the following 

equation and is allocated for the element. 

(3) The number of executions=sum. sub . - - 
cnt/element . sub . - - cnt . 

(76) For example, in FIG. 9B, the number of executions 40 
is allocated for 

the element (D,E). 

(77) In step 704-4, the number of executions calculated 
in step 704-3 are 

allocated for a diagonal element of the element in which 
the process of step 

7 04-3 is executed. For example, in FIG. 9B, the number of 
executions 40 which 

is equal to the number of executions allocated for the 
element (D, E) is 

allocated for the diagonal element (E, D) of the element 
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(D, E) . The diagonal 

element always has the IP attribute. 

(78) The process of the above described step 704 
corresponds to the process 6 

which is executed as a part of the branch probability 
determining/allocating 

process 203 in the explanation of the principle by 
referring to FIG. 2. 

(79) In step 705, the process of the step 704 is repeated 
until all the 

elements having the IP attribute or IS attribute do not 
exist. As a result, 

the array MATRIX (n+2, n) corresponding to the connection 
matrix shown in FIG. 

9C is obtained. This process corresponds to the process 7 
which is executed as 

a part of the branch probability determining/allocating 
process 203 in the 

explanation of the principle by referring to FIG. 2. 

(80) In step 706, only the value of the array element in 
which the IP 

attribute is originally allocated is left and the value of 
the array element at 

which the IS attribute is originally allocated is deleted. 
As a result, array 

MATRIX (n+2, n) corresponding to the connection matrix 
shown in FIG. 9D, is 

obtained. The connection matrix corresponds to the flow 
graph 2 04b shown in 
FIG. 2. 

(81) Finally, in step 707, the value obtained by dividing 
respective values 

of the elements by the sum of the values of the elements 
along the column is 

allocated for the respective elements of each of the 
columns . The value of 

each element shows the branch probability of the path from 
the block 

corresponding to the column in which the element is 
included to the block 

corresponding to the row in which the element is included. 
This process 

corresponds to the process 8 which is executed as part of 
the branch 
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probability determining/allocating process 2 03 in the 
explanation of the 

principle of the present invention by referring to FIG. 2. 

(82) The above described connection matrix calculation 
phase processing unit 

404 enables the branch probabilities to be determined for 
the respective branch 

conditions in the complex IF statement including a 
plurality of branch 

conditions and also enables the precise optimization 
scheduling in the 

compilation process to be performed. 

(83) An Explanation of Other Embodiment 

(84) In the above embodiment, as shown in the program 2 01 
of FIG. 2 for 

example, the values corresponding to the numbers of 
executions of respective 

paths are calculated and the branch probabilities of the 
respective branch 

conditions are assigned to the nodes corresponding to the 
respective branch 

conditions so that the whole branch probability (truth 
ratio=40 percent) of the 

complex IF statement of the statement 2 designated by the 
optimization 

designating statement of the statement 1 is satisfied, 

(85) In contrast, in FIG. 10, the example of the program 
is shown where the 

respective truth ratios are determined for the respective 

branch conditions of 

the complex IF statement. 

(86) In this case, after the complex IF statement is 
resolved into respective 

branches and marks are allocated to the respective 
branches, the respective 

truth ratios designated are allocated for respective 
branches . 

(87) In the program 1001 shown in FIG. 10, 0.4 is 
allocated as the truth 

ratio of the branch condition A, 0.2 as the truth ratio of 
the branch condition 

B, 0.9 as the truth ratio of the branch condition C, 0.8 as 
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the truth ratio of 

the branch condition D and 0.4 as the truth ratio of the 
branch condition E. 



CLAIMS : 

What is claimed is: 

1. A compilation processing method for performing a 
translation process for 

a program including at least one branch statement, using 
initial branch 

probability information for the at least one branch 
statement, the at least one 

branch statement having a plurality of branch conditions, 
said compilation 

processing method comprising the steps of: 

providing nodes, with regard to a branch statement, each 
node being one of a 

branch node representative of a corresponding branch 
condition in the branch 

statement, a first basic block representative of code to be 
executed when a 

decision of the branch statement is true, and a second 
basic block 

representative of code to be executed when the decision of 
the branch statement 
is false; 

creating a flow graph by determining a path from each 
node to each 

corresponding node, each path being input to one of the 

nodes and output from 

another one of the nodes; and 

determining final branch probability information for 
each path from each 

branch condition in the branch statement based on the flow 
graph, each path 

having corresponding branch probability information, said 
corresponding branch 

probability information indicative of a frequency that a 
specified path is 

executed such that the branch probability information for 
the plurality of 

branch conditions in the branch statement initially 
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corresponds to the initial 

branch probability information for the branch statement. 

2. The compilation processing method according to claim 
1, further 

comprising : 

decreasing an optimization level of the path 
corresponding to a selected 

branch condition of the plurality of branch conditions, 
said selected branch 

condition being estimated as having a low execution 
frequency based on the 

determined branch probability information determined for 

said corresponding 

path, 

3 . A compilation processing method for performing a 
translation process for 

a program including at least one branch statement, using 
initial branch 

probability information for the at least one branch 
statement, the at least one 

branch statement having branch conditions, said compilation 
processing method 
comprising the steps of: 

providing nodes, with regard to a branch statement, each 
node being one of a 

branch node representative of a corresponding branch 
condition in the branch 

statement, a first basic block representative of code to be 
executed when a 

decision of the branch statement is true, and a second 
basic block 

representative of code to be executed when the decision of 
the branch statement 
is false; 

creating a flow graph by determining paths from each 
node to each 

corresponding node, each path being input to one of the 

nodes and output from 

another one of the nodes; and 

calculating final branch probabilities for the nodes, 
the final branch 

'probability being calculated for each node based on a 
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number of paths incident 

to the node in the flow graph, the initial branch 
probability of each path 

incident to a selected and corresponding node, each node 
having a corresponding 

branch probability for the selected and corresponding node, 
such that the 

branch probabilities of the incident paths of the nodes of 
said branch 

statement correlate to designated branch probability 
information for the branch 
statement . 

4, The compilation processing method according to claim 
3, further 

comprising: 

decreasing an optimization level of a path corresponding 
to a selected 

branch condition of the branch conditions, said selected 
branch condition being 

estimated as having a low execution frequency based on the 
calculated branch 
probability information. 

5. A compilation processing method for performing a 
translation process for 

a program including at least one branch statement, using 
initial branch 

probability information for the at least one branch 
statement, the at least one 

branch statement having branch conditions, said compilation 
processing method 
comprising the steps of: 

providing nodes, with regard to a branch statement, each 
node being one of a 

branch node representative of a corresponding branch 
condition in the branch 

statement, a truth condition block representative of code 
to be executed when a 

decision of the branch statement is true, and a false 
condition block 

representative of code to be executed when the decision of 
the branch statement 
is false; 

creating a flow graph by determining a path from each 
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node to each 

corresponding node, each path being input to one of the 

nodes and output from 

another one of the nodes; and 

allocating respective values corresponding to numbers of 
executions for the 

node corresponding to said truth condition block and for 
the node corresponding 

to said false condition block/ respectively, each of the 
respective values 

calculated by a ratio based on the initial branch 

probability information 

designated for the branch statement; 

dividing, for each respective node, the value 
corresponding to the number of 

executions allocated for the respective node by the 
respective number of paths 

input to the respective node and producing a result; 

allocating the results of the respective division as the 
respective values 

corresponding to the numbers of executions of the 
respective paths input to the 

respective nodes, for the node which is the truth condition 
block and the node 

which is the false condition block; 

determining a sum of the values corresponding to the 
numbers of executions 

for all of the paths incident to the respective node and 
allocating the sum as 

a corresponding value for the number of executions of the 
respective node; 

dividing the respective values corresponding to the 
numbers of executions 

allocated for the respective node by the number of paths 
input to the 

respective node, and allocating a result of the division as 
the respective 

values corresponding to the numbers of executions of 
respective paths input to 

the node, for each of the nodes of the branch statement; 
and 

calculating a final branch probability of each 
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respective paths of the flow 

graph as the final branch probability of the branch 
condition corresponding to 

the node from which the path is output, by dividing the 
respective values 

corresponding to the number of executions allocated for the 
respective paths by 

the corresponding value for the number of executions 
allocated for the 

respective node from which each respective path is output. 

6. The compilation processing method according to claim 
5, further 

comprising: 

decreasing an optimization level of the path 
corresponding to a selected 

branch condition of the branch conditions estimated as 
having a low execution 

frequency based on the determined branch probability 
information for said 
corresponding path. 

7. A compilation processing apparatus for performing a 
translation process 

for a program including at least one branch statement, 
using initial branch 

probability information for the at least one branch 
statement, the at least one 

branch statement defining conditional decisions, said 
compilation processing 
apparatus comprising : 

information collecting phase processing means for 
extracting nodes, each 

node being one of a branch node representing respective 
branch conditions in a 

branch statement, a first basic block executed when a 
corresponding conditional 

decision in the branch statement is true and a second basic 
block executed when 

the corresponding conditional decision in the branch 
statement is false, each 

node executed in a determined order, for determining a path 
from each branch 

node to each corresponding node, and for collecting 

information concerning a 

preceding node and a succeeding node; 
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block connection relation data forming means for 
forming, based on the 

information collected by said information collecting phase 
processing means, 

block connection relation data having a plurality of 
elements, each element 

indicative of a corresponding connection relation between 

respective nodes; 

and 

block connection relation data calculating means for 
calculating, based on 

the block connection relation data, a final branch 
probability for each of the 

paths corresponding to the respective elements, as final 
branch probabilities 

of the branch conditions included by a corresponding branch 
node from which the 

respective paths are output such that the final branch 
probabilities correspond 

to designated branch probability information for the branch 
statement . 

8 . A compilation processing apparatus for performing a 
translation process 

for a program upon execution of the program, using initial 
branch probability 

information for at least one branch statement defining 
conditional decisions in 

the program, said compilation processing apparatus 
comprising: 

information collecting phase processing means for 
extracting program code 

blocks, each program code block being one of a branch block 
including 

respective branch conditions in a branch statement, a first 
basic block 

executed when a corresponding conditional decision in the 
branch statement is 

true and a second basic block executed when the 
corresponding conditional 

decision in the branch statement is false, the program code 
blocks executed in 

a determined order, for determining paths, a path 
determined from each branch 

block to each corresponding program code block, and for 
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collecting information 

concerning a preceding program code block of respective 
program code blocks and 

information concerning a succeeding block of the respective 

program code 

blocks; 

block connection relation data forming means for 
forming, based on the 

information collected by said information collecting phase 
processing means, 

block connection relation data having a plurality of 
elements, each element 

indicative of a corresponding connection relation between 
respective program 
code blocks; and 

block connection relation data calculating means for 
calculating, based on 

the block connection relation data and the paths between 
the respective program 

code blocks, a branch probability for each of the paths 
corresponding to each 

element, as final branch probabilities of the branch 
conditions included by the 

respective program code blocks from which the paths are 
output such that the 

branch probabilities for the plurality of paths correspond 
to designated branch 

probability information for the branch statement. 

9. A compilation processing apparatus for performing a 
translation process 

for a program, the program including at least one branch 
statement and branch 

probability information for the at least one branch 
statement, the at least one 

branch statement defining conditional decisions, said 
compilation processing 
apparatus comprise: 

information collecting phase processing means for 
extracting basic blocks, 

each basic block being one of a first basic block including 
respective branch 

conditions in a branch statement, a second basic block 
executed when decision 

of the branch statement is true and a third basic block 
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executed when decision 

of the branch statement is false, each said basic block 
executed in a 

determined order, for determining a path from each basic 
block to each 

corresponding basic block, and for collecting concerning a 
preceding basic 

block of the respective first, second and third basic 
blocks and information 

concerning a succeeding basic block of the respective 
first, second and third 
basic blocks; 

block connection relation data forming means for 
forming, based on the 

information collected by said information collecting phase 
processing means, 

block connection relation data having a plurality of 
elements, each element 

indicative of a corresponding connection relation between 

respective basic 

blocks; 

first block connection calculating means for allocating, 
in the block 

connection relation data, a first value corresponding to a 
first number of 

executions, by dividing a second value corresponding to a 
second number of 

executions allocated for the truth condition block, 
calculated based on a 

branch probability designated for the branch statement, by 
the number of 

elements corresponding to all the paths input to the truth 
condition block, 

each element designating one of the paths which is input to 

the truth condition 

block; 

second block connection calculating means for 
allocating, in the block 

connection relation data, a third value corresponding to a 
third number of 

executions, by dividing a fourth value corresponding to a 
fourth number of 

executions allocated for the false condition block, based 
on the branch 

probability designated for the branch statement, by the 
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number of elements 

corresponding to all the paths input to the false condition 
block, each element 

designating one of the paths which is input to the false 
condition block; 

third block connection calculating means for allocating, 
in the block 

connection relation data, a fifth value corresponding to a 
fifth number of 

executions, by dividing a sum of sixth values corresponding 
to sixth numbers of 

the executions allocated for each of the respective 
elements corresponding to 

all the paths output from a selected one of the basic 
blocks by the number of 

elements representing the paths input to the selected basic 
block, for each 

element designating the path input to the selected basic 
block, for each of the 

basic blocks of the branch statement; and 

fourth block connection calculating means for 
calculating, a branch 

probability for each respective element of the block 
connection relation data, 

as the branch probability of the corresponding branch 
condition included by the 

basic block from which the path corresponding to each 
element is output by 

dividing a seventh value corresponding to a seventh number 
of executions 

allocated for each element by a sum of eighth values 
corresponding to eighth 

numbers of executions allocated for the elements 
corresponding to all of the 

paths output from the basic block from which the paths 
corresponding to the 
respective element are output. 

10. A compilation processing method for performing a 
translation process 

for a program including at least one branch statement, 
using initial branch 

probability information for the at least one branch 
statement, the at least one 

branch statement defining a plurality of conditional 
decisions, said 
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compilation processing method comprising the steps of: 

creating a flow graph by resolving the plurality of 
conditional decisions 

defined by the at least one branch statement into 
respective conditional 
decisions; and 

determining branch probability information for each 
respective conditional 

decision in the flow graph based on the flow graph and the 
initial branch 

probability information to to obtain final branch 

probability information for 

the at least one branch statement . 

11. A compilation processing method for performing a 
translation process 

for a program including at least one branch statement, 
using initial branch 

probability information for the branch statement, the 
branch statement defining 

a plurality of conditional decisions, said compilation 
processing method 
comprising the steps of: 

creating a flow graph by resolving the at least one 
branch statement into 

respective conditional decisions; and 

allocating branch probability for each respective 
conditional decision in 

the flow graph, based on the flow graph and the initial 
branch probability 

information to to obtain final branch probability 
information of of the branch 
statement . 

12. The compilation processing method according to 
claim 11 wherein: 

said creating of a flow graph further determines paths 
for the conditional 
decisions; and 

said allocating of branch probability allocates branch 
probabilities for 
each path; and 
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further comprising the steps of decreasing an 
optimization level of a 

selected path of the paths which is determined to have a 
low execution 

frequency based on a respective branch probability for the 
path. 

13, A compilation processing apparatus for performing a 
translation process 

for a program including at least one branch statement, 
using initial branch 

probability information for the branch statement, the 
branch statement defining 

a plurality of conditional decisions, said compilation 

processing apparatus 

comprising: 

information collecting phase processing means for 
resolving the at least one 

branch statement, into respective conditional decisions 
allocating, for each of 

the respective condition decisions, a succeeding basic 
block, and collecting 

information for creating a flow graph; 

connection matrix creating phase processing means for 
creating a connection 

matrix representing the flow graph designating connection 
relations relating to 

branch nodes and succeeding basic blocks, based on the 

collected information; 

and 

connection matrix calculating phase processing means for 
calculating a ratio 

of a number of executions in accordance with the number of 
paths from the 

succeeding basic block to one of the branch nodes, based on 
the initial branch 

probability information by referring to the created 
connection matrix, and 

allocating respective branch probability information for 
each of the respective 
conditional decisions. 
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METHOD OF PERFORMING A 
COMPILATION PROCESS FOR 
DETERMINING A BRANCH PROBABILITY 
AND AN APPARATUS FOR PERFORMING 
THE COMPILATION PROCESS 

This application is a continuation, of application Scr. No. 
08/396590. filed Mar. 1. 1995. now abandoned, which is a 
continuation of Ser. No. 08/113.907, filed Aug. 31. 1993, 
now abandoned. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates to a method of performing 
a compilation process to optimize execution speed of a 
program, by deterxnining the branch probability of a branch 
statement in a program which includes a plurality of branch 
conditions, and further to an apparatus for performing the 
compilation process. 

2. Description of the Related Art 

Increasing the speed of a process of a computer system, 
is being sought a compiler for translating a program at a high 
execution efficiency has been demanded. Such a function of 
the compiler is called optimization. 

A VLIW (very long instruction word) computer capable 
of executing a plurality of instructions in parallel by merely 
designating a single long instruction word at every instruc- 
tion execution cycle has been recently noted. 

When a program to be executed by the VLIW computer 
is compiled, a plurality of instructions able to be executed in 
parallel, because they do not share an arithmetic operating 
unit or datapath and do not require any results of operations 
performed by the other instructions, may be designated by a 
single long instruction word. As a result optimization is 
achieved. By performing such optimization, the rate of 
executing respective instructions of a program to be per- 
formed in parallel can be increased and the execution time 
of a program can be shortened. 
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of each branch is obtained. Based on the number of repeti- 
tions in the loop and the branch probability, a path with a 
large number of executions is selected and a particular trace 
is selected Thus, a plurality ofjnstnictions can be executed 
in parallel by ljismg^tsjpf_fte,n- ace_ which_ovemde s_a 
plurality of basic blocks apdhasa larger number of execu- 
ti ons. CohsequentryTffieTaSo of instructions executable in 
'parallel can be increased and the execution speed of me 
program can be improved. 

Therefore, it is very important to obtain the number of 
repetitions of a loop instruction and a branch probability of 
the branch instruction. 

Additionally, it is useful to obtain the probability of the 
branch instruction. 

Namely, for example, the branch probability of each 
branch instruction is necessary to detect a path with a high 
execution frequency when an optimization is performed, to 
enable the resources, such as a register, to be assigned with 
a high priority to a path having a higher execution frequency 
when performing the compilation. 

When a branch instruction, occurs the execution cost 
becomes higher than if it were not chosen. The execution 
time becomes longer, as compared to the case of non- 
occurrence of the branch, because an instruction cycle for 
setting a branch target address in a program counter is added 
.when a branch occurs. 

Therefore, if the probability of the occurrence of the 
branch at the branch instruction greatly exceeds 50 percent 
the logic of the branch condition is reversed andj heinstrac- 
tion train tobe executed, in the case of the occur rence of the 
branch, is repl aced by the instruction train_toJ>e-executed 
whe n the branch does not occur to shorte n thft execution 
time^ statistically. When the necessity of the above 
cment. is to be determined the branch probability of 
it branch instruction may also be required. 
Further, in order to shorten the time required for the 
compilation to perform an optimization, the level of opti- 
mization may be lowered for a path having a low frequency 



a program can be shortened. 40 0 f execution. In such a case, a branch probability of the 

The above technical background requires technology for instTUCtion ^ ^ required to determine the execu- 
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performing the compilation, and detecting, from a program 
to be processed a plurality of instructions, able to be 
executed in parallel, because they do not share an arithmetic 
operating unit or a data path and do not require any results 
of operations by the other instructions. 

Generally speaking, when a branch instruction exists in 
the program which is compiled, as instruction succeeding 
the branch instruction is not always executed depending on 
the branch condition. Therefore, it is desired quite often to 
detect a plurality of instructions executable in parallel with 
a group of instructions, allocated between two branch 
instructions called a basic block. However, an optimization 
performed for respective basic blocks, merely enables, at 
man'mum. two instructions to be executed as a result of a 5S 
statistical average. 

Effective methods for solving such a bottleneck have been 
proposed, (see. e.g.. a method described as trace scheduling 
by J. R. Ellis. Bulldog: A Compiler for VLIW Architecture^ 
the MIT Tress. 1986). \ 60 

According to the trace scheduling method, a train of » 
instructions (called trace), which have many execution 
times, is detected in a program the instructions in the trace 
are sorted and parallelized to enable the trace to be executed 
with high efficiency. In this case, when a loop instruction 
appears the number of repetitions of the loop is determined, 
similarly, when a branch instruction appears, the probability 
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branch instruction may be required to determine the execu 
tion frequency of respective paths. For conventional opti 
mization in a compilation process, the number of repetitions 
of the loop instructions is frequently shown in a source 
program and can be obtained with relative ease. 
T in contrast, the branch probability of the branch instruc- 
tion can be determined for each respective branch instruc- 
tions by the branch instruction being designated in the 
s ource progr am b y a prp jaammer or by using an automati c 
profile function resident in the computer system. The auto- 
matic profile function can measure the executi on.ratio of an 
instr uction bv compili np the program wimc^.pejfcrming 
th e optimizati on. 

When the branch instruction includes only a branch 
condition, the execution frequency of a path corresponding 
to the branch instruction can be determined immediately 
after the branch probability is deterrnined. 

In contrast, the prior art of a compilation process accom- 
panied by an optimization when the branch instruction 
includes a plurality of branch conditions will be explained 
by the following FIGS. LA through 1C 

The program 111 shown in FIG. 1A is written in a C 
language. When one of the branch conditions A. B or C is 
true, a basic block xx in a THEN clause is executed (the 
word "THEN** is implicit), and if one of the branch condi- 
tions is not true, a basic block yy in an ELSE clause is 
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executed and the basic block zz is executed after the meot being true and a node representing a basic block to be 

completion of either of these clauses. A method of perform- executed where the conditional decision results in the branch 

ing a compilation process in the following two cases (FIGS. statement being false. This flow graph can be created as 

IB and 1C) has been known with regard to an IF statement block connection relation data (corresponding to the con- 

(which is called a complex IF statement) including a plu- s nection matrix table) having the connection relation between 

rality of branch conditions in a statement representing one respective basic blocks as the elements, 

branch* as shown in this example. The branch probability of the path between respective 

As shown in FIG. IB. in a first compilation process, the nodes can be calculated as the branch probability of the 

plurality of branch conditions A. B and C. are calculated as branch condition included with the basic block correspond- 

a group. This first compilation process is advantageous in 10 ing to the node from which the path is outputted, based on 

that the branch probability of a complex IF statement can be the number of paths between respective nodes, to satisfy the 

determined with ease since the branch decision is performed branch probability information designated for the whole 

at a single place. A document corresponding to this prior art branch statement 

is known as "A Method of Selecting Instructions Provided According to the construction of the above present 

with Conditions upon Executing the Instructions** disclosed 15 invention, it is possible to decrease the optimization level of 

in Japanese Patent Riblication Tokkobei 1-42019. the path to which the branch condition specifies branching 

However, in the prior art shown in FIG. IB, the branch to, when the path is determined to have a low execution 
probabilities cannot be obtained for the respective branch frequency based on the branch probability, 
conditions A. B and C. In the case of a basic block in which According to the construction of the present invention, it 
respective branch conditions A, B and C comprise a plurality 30 is possible to allocate the branch probability information for 
of instructions, the first compilation process cannot achieve each respective branch conditions in one branch statement 
a precise optimization by selecting an optimum path from and to predict a frequency of the execution each respective 
paths of the branches depending on respective branch con- instructions in the program with high precision when per- 
ditions, forming the compilation operation. The present invention 

As shown in FIG. 1C, in the second compilation process, 25 may be applied to performing scheduling based on the 

a decision involving a plurality of branch conditions is frequency of execution of the respective instructions, such as 

resolved into respective branch paths. Also, in the second the trace scheduling method. This thereby enables the com- 

compilation process, branch decisions are not performed at pilation process, for translating the program, to be per- 

one place but are performed for respective branch condi- formed and the program to be operated with high efficiency. 

T "^"^ f^.L ton *. CO °f, ti ™ £ 30 BRIEF DESCRIPTION OF THE DRAWINGS 
is true, the basic block xx m the THEN clause is immediately 

executed and the determination of the other branch condi- The other objects and features of the present invention can 

tions is not executed. Consequently, with regard to the same be easily understood by persons skilled in the art based on 

source program* an execution time of an executable program the description relating to the preferred embodiments of the 

generated based on the compilation process shown in FIG. 35 present invention together with the attached drawings. 

1C is statistically shorter than that of the executable program FIG. 1A shows an explanatory view of the prior art; 

generated based on the compilation process shown in FIG. . pxq, rjj shows an explanatory view of the prior art; 

FIG. 1C shows an explanatory view of the prior art; 

In the operations generated based on the compilation ^ pjQ 2 shows an explanatory view of principle of the 

process shown in FIG. 1C. a branch decision is performed present invention* 

for individual branch conditions in a complex IF statement mQ 3A ^oWan explanatory view of a second principle 

Thus it is possible to perform a precise optimization by of mc prcsent invention- 

SSS : te tTdivt3^£^ m 4 , ra-'Bshows a s^nd view of . second 

. 45 principle of the present invention; 

AC °?^ ? 5 d f t ^ FIG. 3C shows a third explanatory view of a second 

respective branch prcbah^tees ^fh principle of the present invention; 

conditions based on the branch probability allocated for the TZ^T A . _< fc , - ^ - 

, . . . . . r . w J ^ . , FIG. 4 shows a structural view of the preferred embodi- 

whole complex if statement However, the method of t of the t in ention* 

enabling determination of such branch probabilities is not w „ „ . \ ~ . . . . , 

known. FIG. 5 shows an operational flowchart in the information 

collecting phase in the p r ef err e d embodiment of the present 

SUMMARY OF THE INVENTION invention; 

An object of the present invention residing in the above R ^6A shows one part of an o^o^l^chanin* 

desm^ed^ckground. is to enable the branch probabilities 5 5 <*nne^on ™tnx creating phase of the preferred embodi- 

of the respective branch conditions to be allocated based on ment of the present invention; 

the branch probability allocated for the whole complex IF . FIG. 6B shows the further part of the operation flowchart 

statement and to predict the frequency of the execution with in a connection matrix creating phase of the preferred 

high precision so that an executable program with high embodiment of the present invention; 

efficiency is created. «o FIG. 7 shows an operational flowchart in the connection 

The present invention creates a flow graph, for the branch matrix calculating phase of the preferred embodiment of the 

statement representing one branch including a plurality of present invention; 

branch conditions. This flow graph comprises nodes, repre- FIG. 8A shows an explanatory view of the data structure 

seating basic blocks, including the respective branch con- °f me basic block; 

ditions in the branch statement The flow graph further 65 FIG. 8B shows an explanatory view of a dummy block; 

comprises a node representing a basic block to be executed FIG. 9A shows one view of the example of the connection 

where the conditional decision results in the branch state- matrix; 
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FIG. 9B shows a second view of the example of the allocated as 4$ and the number of executions at node yy is 

connection matrix; allocated as 60. 

FIG. 9C shows a third view of the example of the Process 4: The respective numbers of paths input to 

connection matrix respective nodes xx and yy are searched. In the example of 

™~ «~ ^ ' c - * *u 4.. 5 FIG. 2. two paths are input to node xx and three paths to 

FIG. 9D shows a fourth view of the example of me ™ • ^ 

a node yy. 

connection matrix; and Process 5: The numbers of executions at the respective 

FIG. 10 shows an explanatory view of an operation of node$ ^ ^ ^ by ^ respective numbers of 

another embodiment of the present invention. inputted to respective nodes. The respective results of 

DESCRIPTION OF THE PREFERRED 10 the divisions are determined as the respective numbers of 

EMBODIMENT executions of respective paths input to the node. In the 

Explanation of the Principle example of nodes xx and yy. the following allocation is 

The principle of the present invention will be explained. performed. 

In FIG. 2. program 2*1 is an example of a program The number of executions for the paths from node D to 

subject to a compilation process. In program 201. statement 15 node xx is allocated as 40/2=20 (40 total executions at xx/2 

1 designates an optimization designating statement, state- paths inputting to xx). 

meat 2 shows a complex IF statement, and xx in statement Similarly the number of executions for the paths from 

3 represents a train of instruction statements which should node B to node xx is allocated as 40f2=20. 

be executed when the branch condition of the complex IF The number of executions for the paths from node B to 

statement is true known as a THEN clause. (The word then 20 node yy is allocated as 6(Y3=20 (60 total executions at yy/3 

THEN is implicit in the complex IF statement.) yy in paths inputting to yy). 

statement 4 represents a train of instruction statements Similarly, number of executions for the paths from node 

which should be executed when the branch condition of the C to node yy is allocated as 6XV3=20. 

complex IF statement is false, knows as an ELSE clause. Similarly again, the number of executions for the paths 

The branch conditions A,B,CD and E. xx in the THEN 25 from node E to node yy is allocated as 60/3=20. 

clause and yy in the ELSE clause respectively comprise a Process 6: For each intermediate node at which the 

c ontinuous tra in of instructions which are stopped^ or numbers of the executions of all the paths output from 

branched mexefcccTu^s-is^aUed-a^ic^Jlc^ * intermediate node to other intermediate nodes is aHocated. 

In the program 201. the optimization designating state- the sum of the numbers of the executions of all the outgoing 

ment of statement 1 designates that the branch probability of 30 paths is determined as the number of executions at the 

the complex IF statement designated by statement 2, intermediate node. In the example shown above, the number 

specifically, the truth probability in which the THEN clause of the executions for the paths from node E to node xx is 20, 

designated by statement 3 (orxx) is executed, is 40 percent the number of the executions for the path from node E to 

In flow graph creating process 202. designated by the node yy is 20, and the number of paths output from node E 

following processes land 2. with regard to program 201. a 35 is only 2. Accordingly, the number of executions at node E 

plurality of branch conditions A, B, C, D and E. in the is 20*20=40 (times). Then, me number of executions at the 

complex IF statement designated by the statement 2. are not intermediate node, here node E. is divided by the number of 

treated as a group. Instead individual branch conditions are poms input to the intermediate node, here node E. result of 

separated, thereby creating a flow graph 204a which desig- the division is determined to be the number of executions of 

nates a control flow corresponding to the logical value of the 40 each respective paths input to the intennediate node, 

respective branch condition. Process 7: The above process 6 is repeated for each 

Process 1 : The complex IF statement designated by state- intermediate node and thus the numbers of executions along 

ment 2 is resolved into individual branch conditions. The all the paths are calculated. As a result, the flow graph 204fc 

THEN clause. ELSE clause and individual branch condi- shown in FIG. 2 is obtained. 

tions are allocated with respective marks. 45 Process 8: The numbers of executions along the respee- 

Process 2: Flow graph 204a is created for the branch tive paths arc respectively divided by the number of execu- 

conditions each allocated with the marks. tions at the node from which respective paths are outputted. 

The flow graph 204a created by the above recited pro- thereby calculating the branch probabilities of respective 

cesses 1 and 2 is comprised of intermediate nodes A. B. C. paths. For example, the branch probability from the node A 

D and E, respectively corresponding to branch conditions A. so to node C is 40/100 =0.4 and the branch probability from the 

B, C. D and E; a final node xx. corresponding to the THEN node B to node C is 40/60=0.66. 

clause xx; a final node yy corresponding to the ELSE clause By performing the above explained flow graph creating 

yy; and paths connected between the respective nodes. process 202 and branch probability detcnnining/allocating 

In the branch probability determining/allocating process process 203, the respective branch probabilities can be 

203. designated by the following processes 3 to 8, the values 55 determined for respective branch conditions in a complex IF 

corresponding to the numbers of executions by respective statement, thereby enabling precise optimization scheduling 

paths are calculated. Consequently the whole branch prob- to be performed in a compilation process, 

ability (true ratio equals 40 percent) of the complex IF For example, perfoiming a precise optimization based on 

statement of statement 2 designated by the optimization the trace scheduling method by selecting an optimum path 

designating statement of statement 1 is satisfied by the flow 60 with a high branch probability, from among paths branched 

graph 204a. Branch probabilities of respective branch con- under the individual branch conditions in the complex IF 

ditions are then assigned to the nodes corresponding to the statement becomes possible. 

respective branch conditions. Additionally, performing an optimization to allow 

Process 3: The number of executions is allocated for final resources, such as a register to be assigned with a high 

nodes xx and yy based on the designation by the optimiza- 65 priority to the path with the higher branch probability from 

tion designating statement of statement 1. In the example among all the paths branched under a respective branch 

shown in FIG. 2. the number of executions at node xx is conditions in the complex IF statement becomes possible. 
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When the brand) probability of a path of a branch 
occurring under respective branch conditions in the complex 
IF statement exceeds 50 percent significantly, the logic of 
the branch condition may be reversed and the instruction 
train to be executed, when the branch occurs is replaced by 
an instruction train to be executed when the the branch does 
not occur, thereby shortening execution time statistically. 

Time required for a compilation of the program is short- 
ened by decreasing the optimization level of the path, 
determined to have a low execution frequency based on the 
branch probability allocated as described above. 
Conventionally, the optimization level is changed during a 
compilation process with regard to a specific program but 
the optimization level of the specific path in a single 
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A connection matrix calculating phase processing unit 
404 executes a phase for calculating branch probabilities , 
(truth ratio) using a connection matrix. 

The above processing unit can be realized as a function of 
part of a compiler for executing the compilation process. 

The detail of the phase to be executed by the respective 
processing units is now explained according to the opera- 
tional flowchart shown in FIGS. 5 to 7. FIG. 8A shows an 
explanatory view of the data structure of the basic block; 
FIG. SB shows an explanatory view of the dummy block; 1 
and FIGS. 9A. 9B. 9C and 9D show a view of an example / 
of the connection matrix. 

Information collecting phase processing unit 402 exe 
the process of steps 501 to 505 designated by an operational 



25 



program is not changed based on the frequency of the I flowchart shown in FBGLaV N 

executions. In order to perform the optimization, analysis 15 / program, which ^converted to an intermediate code 

the flow of control of the program and the flow of the data / by structure analysis an3seSStksTn5ay sis in a compilation 

is necessary. Hie higher ore level of optimization planned. process, is arra ng ed In units of a basic block (w g eh is 

me more precise an analysis is required, renrtuBin a longer ^^JTa Mock h etem^^ithTe gard to respective 

^required for the c«m P ilat,on process. However the p^rh I ^^^^ mw Mofmation concerning a preceding 

with a low frequency of execution is not executed , . . . . \~ ~ 

frequently, as a rLlt even if the c*timization level is low T ^J^^L? 8 

IZT^t^^^Tn^^^l / FIG. 8A shows the data structure of a block (basic block), 
total execution time of the program. In some casesjtoten- lA^ block has the data structure ^rr^rising a pointer to a list 

designating a parent block, a pointer to a list designating a 
child block* a pointer designating an instruction at the head 
of the block, a pointer designating an instruction at the end 
of the block and an attribute area. Control flow information 
relating to the preceding block and the succeeding block of 
^ Ithe particular block obtained In step 591 is allocated respec- 
tively as a pointer to a list designating the parent block and 
a pointer to a list designating the child block in the structure 
defining the particular block is shown in FIG. 8A. 

Id step 502 in FIG. 5. a distinguishing mark is allocated 
at a block in which the branch condition in the complex IF 
statement comprising the branch probability designation is 
included. The distinguishing mark distinguishes the com- 
plex IF statement, including the branch condition of the 
block at which the distinguishing mark is allocated and 
actually comprises a unique address or flag. 

In step 513, a THEN attribute is allocated at a block 
representing a THEM clause (which is called a THEN block 
hereinafter) and an ELSE attribute is allocated to a block 
representing an ELSE clause (hereinafter called an ELSE 
block hereinafter). 

A distinguishing mark* a THEN attribute or an ELSE 
attribute for a particular block allocated in steps 502 and 503 
is allocated to an attribute area of the structure which defines 
the particular block shown in FIG, 8A. 

Instead of allocating the attribute area in the structure of 
the block as shown in FIG. &A. a branch condition of a 
complex IF statement and a dummy instruction representing 
a THEN clause or an ELSE clause (which is represented as 
dummy-true or dummy-false in FIG. 8B) may be inserted as 
the block as shown in FIG. SB. 

In step 504. a truth ratio of the branch probability of the 
complex IF statement designated by the optimization des- 
ignating statement (statement 1 in FIG. 2. for example) is 
obtained. 

In step 505. a structure information table is prepared for 
the complex IF statement The structure information table 
includes the following information; 
Designated truth ratio 

Pointers to structures of blocks including branch condi- 
tions and the number of pointers 
A pointer to the structure of a THEN block 
A pointer to the structure of an ELSE block 



ing the compilation time of the program is more 
than shortening the execution time of the program. 
Accordingly, the time required for the optimization process, 
which may be time-consuining. may be shortened by 
decreasing the cptirnization level of the path along with a 
low number of executions based on the allocated branch 
probability. 

FIGS. 3A. 3B and 3C show explanatory view of another 
principle of the present invention. 

The above explanation is applied to the case where the 
ELSE clause exists in the complex IF. respectively state- 
ment As shown in a program 301 in FIG. 3A, the same 
explanation made above may be applied where the ELSE 
clause does not exist In this case, the probability at which 
the node zz is executed is actually 100 percent However the 
branch probability is calculated under the assumption that 
node zz is executed at a probability of 60 percent when the 
branch probability is calculated based on the principle 
recited above. As a result as shown in a flow graph 302 in 
FIG. 3B. the numbers of executions travelling through 
respective paths are calculated and the branch probabilities 
of respective branch conditions are calculated. 

In the example of the program 201 shown in FIG. 2. the 
branch conditions D and E are deleted and the control flow 
is changed when branch condition D is always true in the 
compilation process subject to the optimization. In this case 
shown in FIG. 3B . the control flow is examined immediately 
before the calculation of the branch probabilities. The the 
branch probabilities of respective branch conditions can be 
allocated correctly as shown in flow graph 303 in FIG. 3C. 
Explanation of Preferred Embodiments 

The preferred embodiments of the present invention based 
on the above described operational principle is now 
explained. 

FIG. 4 shows a structure diagram of a preferred embodi- 
ment of the present invention. 

A processing apparatus 401. allocated with a CPU and 
memory, executes a compilation process. 

An information collecting phase processing unit 402 
executes a phase for collecting compilation information to 
create a flow graph. 

A connection matrix creating phase processing unit 403 
executes a phase for creating a connection matrix represent- 
ing a flow graph. 
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The above described process in the information collecting Accordingly. the connection matrix calculating phase 

phase processing unit 402 corresponds to process 1. processing unit 404 executes the process, designated by the 

whereby the process is executed as part of the flow graph operational flowchart shown in FIG. 7. 

creating process 202 explained as the principle of the present In step 711. respective numbers of paths inputted to the 

inveZrVb7^erringtona2. 5 THEN block and the ELSE block are detected by referring 

Sequentially, the connection matrix creating phase pro- to the structure information table and the structure^ of the 

cessing unit 403 executes the processes shown in the opera- block designated by the structure information table. The 

tional flowcharts in FIGS. 6A and *B. number of paths inputted to the THEN Mock is expressed as 

As shown in step 601 in FIG. 6A, the information then_path_cnt and the number of paths inputted to the else 

collecting phase processing unit 402 refers to the structure 10 block is expressed as else_path_cnt. This process corre- 

information table prepared in step 505 of FIG. 5. Step 602 spends to the process 4 which is executed as part of the 

creates a connection matrix having rows corresponding to n branch probability detennining/allocating process 203 in the 

blocks, each including a branch condition, a THEN block explanation of the principle based on FIG. 2. 

and an ELSE block, and having columns cccresponding to Next, in step 70Z the number of executions to be calcu- 

the n blocks, each including a branch condition with regard 15 lated by the following equation is allocated in the array 

to each complex IF statement. For example, the connection element MATRIX (n+1, i)(i=l □) where the IP attribute 

matrix created from program 201 shown in FIG. 2 is is set 

reflected in FIG. 9A ^ . 

A block corresponding to a row in the connection matrix ^ ***** * «ecuto*4h* tnnh 

is called a row block and the block corresponding to a 20 j s ^ m the structure information table and is designated for 

column is called a column block, and the portion where the ^ complex jp statement which is the subject of the 

row and the column intersect is called an element process, in accordance with an optimization designating 

When the connection matrix is created in the step 602, as statement 

an initial state of the connection matrix, the respective Sequentially, in step 703. the number of executions to be 

elements of the connection matrix have been allocated with 25 calculated by the following equation is allocated in the array 

an IP attribute, IS attribute, or DISCONNECT attribute. The clcmcnt MATRIX (n+2. i) (i=l n) in which the IP 

IP attribute designates that the column block corresponding attribute is set 
to the element precedes the row block in the control flow the 

IS attribute designates that the column block corresponding The number of e ic c m ioo s-< to(Mhe truth Mfc>yebe_j*ih_ait. 

to the element succeeds the row block in the control flow. 30 ^ . . JJ1J)L1U , f . 

and the DISCONNECT atnibut e designates that the column ^f^S^SSt £ SSElS 

Nock co^sponding to tie clement * not ^ * the row fteotpla- 

S^EEKS^* SZrf toe principle Led c ria ^As a result, the array 

MSCONNECT attribute >s represented by mark X in FIGS. ^ ^TUX (JTn) corresponding to the connection matri* 

^^^^^^^!^^Z ^uSy. * ^l^tilowing step, 704-1 to 
matnxcreanng process shown in step 602 is represented in ^ ^^uted to the column in which attribute . 

HG.«B. does not exist and the IS attribute exists, namely, for a block 

V— ^nlT^i? «femwhicht^branch ^^wlrich the number of executions of all the paths outputted 
condxuon ^^^^^ _ . . ^ 40 ta me Wo ck to the other 

In step 604, the array MATRIX (n+2, n), where Ae the sum of the mimbers of respective 

number of the row element is n+2 and the number of the in aqj _7. , i .-a *~ Jwm «tc in JhM th» 

, : . . m ■ matdtv executions sum__cnt is calculated for elements in which me 

column element is n, is generated. This array MATRIX CM ~™ c ~- • _ _ , fUm ^,„„ Vnr 

. * numbers of the executions are stored, along the column, ror 

*T£ the elements of the array «. to the column E shown in FIG. 9B. the 

MAI1UX is allocated as ^DISCONN^T attribute at the "^"^^^^^T^enumber of dements which have the IS 

iruualstate of prepanng fte ar^ M^TWX .nnStong *e column is calculated as c!cment_cnt. For 

Accordingly, the value of index i is incremented from 1 to , . A . . c -t™™ j- inr 1 on *h» 

n+2 bjTKp. 606. 612 and 613 and the value of index with regard to the column E shown in FIG. 9B. the 

j is incremented from 1 to n by 1 with 30 ^TtglZ* the value of element (the number of 

1 "TlXLZZLSi KS^* «e«Zs) which have the IS attribute along the column is 

"S^SX KSnSwie^Ld* Hoc* caicuUted Lrf on the following e^tioo and is aHocated 

of i-th block is the j-th block. If the detennination is YES, for mc dement 

in step 609, the IP attribute is allocated at the array element 55 T^mmber <rf executk*^^ 

MATRIX (i, j) and the IS attribute is allocated at the array 

element MATRIX (j. i). For example, in FIG. 9B, the number of executions 40 is 

When the processes for all the array elements are allocated for the element (DM). 
completed, the connection matrix creating phase processing In step 704-4, the number of executions calculated in step 
unit 403 has completed its operation. As a result, an array 60 704-3 are allocated for a diagonal element of the element in 
MATRIX (n+2, n) cccresponding to the connection matrix which the process of step 704-3 is executed. For example, in 
shown in FIG. 9A is obtained. FIG. 9B, the number of executions 40 which is equal to the 

The process performed by the connection matrix creating number of executions allocated for the element (D. E) is 
phase processing unit 403 corresponds to the process 2, allocated for the diagonal element (E, D) of the element (D, 
executing as part of the flow graph creating process 202 in 65 E). The diagonal dement always has the IP attribute, 
the explanation of the principle of the present invention by The process of the above described step 704 corresponds 
referring to FIG. 2. to the process 6 which is executed as a part of the branch 
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probability detenmnmg/aUocating process 203 in the expla- 
nation of the principle by referring to FIG. 2. 

In step 705, the process of the step 704 is repeated until 
all the elements having the IP attribute or IS attribute do not 
exist As a result the array MATRIX (n+2. n) corresponding 
to the connection matrix shown in FIG. 9C is obtained. This 
process corresponds to the process 7 which is executed as a 
part of the branch probability detennining/allocating process 
205 in the explanation of the principle by referring to FIG. 
Z 

In step 706, only the value of the array element in which 
the IP attribute is originally allocated is left and the value of 
the array element at which the IS attribute is originally 
allocated is deleted. As a result, array MATRIX (n+2, n) 
corresponding to the connection matrix shown in FIG. 9D, 
is obtained. The connection matrix corresponds to the flow 
graph 2046 shown in FIG. 2. 

Finally, in step 707, the value obtained by dividing 
respective values of the elements by the sum of the values 
of the elements along the column is allocated for the 
respective elements of each of the columns. The value of 
each element snows the branch probability of the path from 
the block corresponding to the column in which the element 
is included to the block corresponding to the row in which 
the element is included. This process corresponds to the 
process 8 which is executed as part of the branch probability 
deiermiiiing/allocating process 203 in the explanation of the 
principle of the present invention by referring to FIG. 2. 

The above described connection matrix calculation phase 
processing unit 404 enables the branch probabilities to be 
determined for the respective branch conditions in the 
complex IF statement including a plurality of branch con- 
ditions and also enables the precise optimization scheduling 
in the compilation process to be performed. 
An Explanation of Other Embodiment 

In the above embodiment as shown in the program 201 
of FIG. 2 for example, the values corresponding to the 
numbers of executions of respective paths arc calculated and 
the branch probabilities of the respective branch conditions 
are assigned to the nodes corresponding to the respective 
branch conditions so that the whole branch probability (truth 
ratio=40 percent) of the complex IF statement of the state- 
ment 2 designated by the optimization designating statement 
of the statement 1 is satisfied. 

In contrast in FIG. 10, the example of the program is 
shown where the respective truth ratios are determined for 
the respective branch conditions of the complex IF state- 
ment 

In this case, after the complex IF statement is resolved 
into respective branches and marks are allocated to the 
respective branches, the respective truth ratios designated 
are allocated for respective branches. 

In the program 1001 shown in FIG. 10, OA is allocated as 
the truth ratio of the branch condition A, 0^ as the truth ratio 
of the branch condition B. 0.9 as the truth ratio of the branch 
condition C. 0.8 as the truth ratio of the branch condition D 
and 0.4 as the truth ratio of the branch condition E. 

What is claimed is: 

L A compilation processing method for performing a 
translation process for a program including at least one 
branch statement, using initial branch probability mforma- 
tion for the at least one branch statement the at least one 
branch statement having a plurality of branch conditions, 
said compilation processing method comprising the steps of: 
providing nodes, with regard to a branch statement, each 
node being one of a branch node representative of a 
corresponding branch condition In the branch 
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statement, a first basic block representative of code to 
be executed when a decision of the branch statement is 
true, and a second basic block representative of code to 
be executed when the decision of the branch statement 
5 is false; 

creating a flow graph by detennining a path from each 
node to each corresponding node, each path being input 
to one of the nodes and output from another one of the 
nodes; and 

10 detennining final branch probability information for each 
path from each branch condition in the branch state- 
ment based on the flow graph, each path having cor- 
responding branch probability information, said corre- 
sponding branch probability information indicative of a 

15 frequency that a specified path is executed such that the 
branch probability information for the plurality of 
branch conditions in the branch statement initially 
corresponds to the initial branch probability informa- 
tion for the branch statement 

20 2. The compilation processing method according to claim 
1, further comprising: 

decreasing an optimization level of the path correspond- 
ing to a selected branch condition of the plurality of 
branch conditions, said selected branch condition being 

25 estimated as having a low execution frequency based 
on the determined branch probability information 
determined for said corresponding path. 

3. A compilation processing method for performing a 
translation process for a program including at least one 

30 branch statement using initial branch probability informa- 
tion for the at least one branch statement the at least one 
branch statement having branch conditions, said compilation 
processing method comprising the steps of: 
35 providing nodes, with regard to a branch statement, each 
node being one of a branch node representative of a 
corresponding branch condition in the branch 
statement a first basic block representative of code to 
be executed when a decision of the branch statement is 
^ true, and a second basic block representative of code to 
be executed when the decision of the branch statement 
is false; 

creating a flow graph by determining paths from each 
node to each corresponding node, each path being input 
4S to one of the nodes and output from another one of the 
nodes; and 

rflirubiring final branch probabilities far the nodes, the 
final branch probability being calculated for each node 
based on a number of paths incident to the node in the 

50 flow graph, the initial branch probability of each path 
incident to a selected and corresponding node, each 
node having a corresponding branch probability for the 
selected and corresponding node, such that the branch 
probabilities of the incident paths of the nodes of said 

55 branch statement correlate to designated branch prob- 
ability information for the branch statement 

4. The compilation processing method according to claim 
3. further comprising: 

decreasing an optimization level of a path corresponding 
60 to a selected branch condition of the branch conditions, 
said selected branch condition being estimated as hav- 
ing a low execution frequency based on the calculated 
branch probability information. 

5. A compilation processing method for performing a 
65 translation process for a program including at least one 

branch statement, using initial branch probability informa- 
tion for the at least one branch statement the at least one 
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branch statement having branch conditions, said compilation 

processing method comprising the steps of: 
providing nodes, with regard to a branch statement each 
node being one of a branch node representative of a 
corresponding branch condition in the branch 
statement, a truth condition block representative of 
code to be executed when a decision of the branch 
statement is true, and a false condition block represen- 
tative of code to be executed when the decision of the 
branch statement is false; 
creating a flow graph by determining a path from each 
node to each corresponding node, each path being input 
to one of the nodes and output from another one of the 
nodes; and 

allocating respective values corresponding to numbers of 
executions for the node corresponding to said truth 
condition block and for the node corresponding to said 
false condition block, respectively, each of the respec- 
tive values calculated by a ratio based on the initial 
branch probability information designated for the 
branch statement; 

dividing, for each respective node, the value correspond- 
ing to the number of executions allocated for the 
respective node by the respective number of paths input 
to the respective node and producing a result; 

allocating the results of the respective division as the 
respective values corresponding to the numbers of 
executions of the respective paths input to the respec- 
tive nodes, for the node which is the truth condition 
block and the node which is the false condition block; 

determining a sum of the values corresponding to the 
numbers of executions for all of the paths incident to 
the respective node and allocating the sum as a corre- 
sponding value for the number of executions of the 
respective node; 

dividing the respective values corresponding to the num- 
bers of executions allocated for the respective node by 
the number of paths input to the respective node, and 
allocating a result of the division as the respective 
values corresponding to the numbers of executions of 
respective paths input to the node, for each of the nodes 
of the branch statement; and 

calculating a final branch probability of each respective 
paths of the flow graph as the final branch prcoability 43 
of the branch condition corresponding to the node from 
which the path is output, by dividing the respective 
values corresponding to the number of executions 
allocated for the respective paths by the corresponding 
value for the number of executions allocated for the 
respective node from which each respective path is 
output 

6. The compilation processing method according to daim 
5, further comprising: 

decreasing an optimization level of the path correspond- 
ing to a selected branch condition of the branch con- 
ditions estimated as having a low execution frequency 
based on the detennined branch probability informa- 
tion for said corresponding path. 

7. A compilation processing apparatus for performing a 
translation process for a program including at least one 
branch statement, using initial branch probability informa- 
tion for the at least one branch statement, the at least one 
branch statement defining conditional decisions, said com- 
pilation processing apparatus comprising: 

information collecting phase processing means for 
extracting nodes, each node being one of a branch node 



representing respective branch conditions in a branch 
statement, a first basic block executed when a corre- 
sponding conditional decision in the branch statement 
is true and a second basic block executed when the 
corresponding conditional decision in the branch state- 
ment is false, each node executed in a determined order, 
for determining a path from each branch node to each 
corresponding node, and for collecting information 
concerning a preceding node and a succeeding node; 
block connection relation data forming means for 
forming, based on the information collected by said 
information collecting phase processing means, block 
connection relation data having a plurality of elements, 
each element indicative of a corresponding connection 
relation between respective nodes; and 
block connection relation data calculating means for 
rflfnitftrin^ based on the block connection relation 
data, a final branch probability for each of the paths 
corresponding to the respective elements, as final 
branch probabilities of the branch conditions included 
by a corresponding branch node from which the respec- 
tive paths are output such mat the final branch prob- 
abilities correspond to designated branch probability 
information for the branch statement 
^ 8. A compilation processing apparatus for performing a 
translation process for a program upon execution of the 
program, using initial branch probability information for at 
least one branch statement defining conditional decisions in 
the program, said compilation processing apparatus com- 
30 prising: 

information collecting phase processing means for 
extracting program code blocks, each program code 
block being one of a branch block including respective 
branch conditions in a branch statement, a first basic 
block executed when a corresponding conditional deci- 
sion in the branch statement is true and a second basic 
block executed when the corresponding conditional 
decision in the branch statement is false, the program 
code blocks executed in a determined order, for deter- 
mining paths, a path determined from each branch 
block to each corresponding program code block, and 
for collecting information concerning a preceding pro- 
gram code block of respective program code blocks and 
information concerning a succeeding block of the 
respective program code blocks; 
block connection relation data forming means for 
forming, based on the information collected by said 
information collecting phase processing means, block 
connection relation data having a plurality of elements, 
each element indicative of a corresponding connection 
relation between respective program code blocks; and 
block connection relation data calculating means for 
calculating, based on the block connection relation data 
and the paths between the respective program code 
blocks, a branch probability for each of the paths 
corresponding to each element* as final branch prob- 
abilities of the branch conditions included by the 
respective program code blocks from which the paths 
are output such that the branch probabilities for the 
plurality of paths correspond to designated branch 
probability information far the branch statement. 
9. A compilation processing apparatus for performing a 
translation process for a program* the program including at 
least one branch statement and branch probability informa- 
65 tion for the at least one branch statement, the at least one 
branch statement defining conditional decisions, said com- 
pilation processing apparatus comprise: 
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information collecting phase processing means for branch statement using initial branch probability informa- 

extracting basic blocks, each basic block being one of tion for the at least one branch statement the at least one 

a first basic block including respective branch condi- branch statement defining a plurality of conditional 

tions in a branch statement, a second basic block decisions, said compilation processing method comprising 

executed when decision of the branch statement is true 5 the steps of: 

and a third basic block executed when decision of the creating a flow graph by resolving the plurality of con- 
branch statement is false, each said basic block ditfonal decisions defined by the at least one branch 
executed in a detemiined order, for determining a path statement into respective conditional decisions; and 
from each basic block to each corresponding basic / r . 
block, and for collecting concerning a preceding basic 10 a^erinining branch probability i^^rnation for each 
block of the respective first second and third basic respective conditional decision in the flow graph based 
blocks and information concerning a succeeding basic on the flow graph and the initial branch probability 
block of the respective first second and third basic Mormation to to obtain final branch probability infor- 
blocks* mation for the at least one branch statement. 

block connection relation data forming means for " U A compilation processing method for performing a 
forming, based on the information collected by said translation process for a program including at least one 
information collecting phase processing means, block branch statement using initial branch probability Morma- 
connection relation data having a plurality of elements. tion for the branch statement the branch statement defining 
each element indicative of a corresponding connection a plurality of conditional decisions, said compilation pro- 
relation between respective basic blocks; 20 cessing method comprisiiig the steps of: 

first block connection calculating means for allocating, in creating a flow graph by resolving the at least one branch 

the block connection relation data, a first value corre- statement into respective conditional decisions; and 

sponding to a first number of executions, by dividing a allocating branch probability for each respective condi- 

second value corresponding to a second number of ^ tional decision in the flow graph, based on the flow 

executions allocated for the truth condition block, graph and the initial branch probabuity information to 

calculated b*sedc* a b^ to Obtain final branch probability information of of the 

the branch statement, by the number of elements car- dhstatemenL 

responding to all the paths input to the truth condition « J*!?? slaI ™™ . n . 

irajwiwmjwau *~ *~ \2. The compilation processing method according to 

block, each element designating one of the paths which iuc u^iwu 6 

is input to the truth condition block; 30 claim 11 wherein: 

second block connection calculating means for allocating, said creating of a flow graph further determines paths for 

in the block connection relation data, a third value conditional decisions; and 

corresponding to a third number of executions, by said aUocating of branch probability allocates branch 

dividing a fourth value corresponding to a fourth num- 35 probabilities for each path; and 

ber of executions allocated for the false condition further comprising the steps of decreasing an c^rtirnization 

block, based on the branch probability designated for level of a selected path of the paths which is determined 

the branch statement by the number of elements cor- to ^ a ^ execution frequency based on a respec- 

responding to all the paths input to the false condition ^ for ^ ^ 

block, each element desi^g one of the paths which ^ 13. a compilation processing apparatus for performing a 

is input to mc false condition block; translation pocess for a program including at least one 

third block connection calculating means for allocating, in statement Qsing initial branch probability informa- 

the block connection relation data, a nfm value corre- ^ fa ^ bmocfa statcmenU the branch statement defining 

spondmg to a fifth number of executions by dmdmg ,a conditional decisions, said compilation pro- 

sum of sixth values rf 45 Zp*ntos coinprising: 

the executions allocated for each of the respective «»""B «w«i«u ^ * 

elements coraponding to all the paths output from a urfc^on coOectu^ 

selected cwTlLic blockst^ number of * *g the at one branch ^statcmeiit jinto respective 

elements representing the paths input to the selected coiidmonal decisions all 

basic blc*lTto 50 ^ 3 "T^^J^ 

to the selected basic block, for each of the basic blocks collecting information for creating a flow graph; 

of the branch statement; and connection matrix creating phase processing means for 

fourth block connection calculating means for calculating, creating a connection matrix representing the flow 

a branch probabiUty for each respective element of the &*P* designating connection ^ons relating to 

block connection relation data, as the branch pfobabil- 55 branch nodes and succeeding basic blocks, based on the 

try of the corresponding branch condition included by collected information; and 

the basic block from which the path corresponding to connection matrix ca lc ulati ng phase processing means for 

each element is output by dividing a seventh value calculating a ratio of a number of executions in accor- 

ccnesponding to a seventh number of executions alio- dance with the number of paths from the succeeding 

cated for each element by a sum of eighth values €0 basic block to one of the branch nodes, based on the 

corrcsponding to eighth numbers of executions alio- initial branch probability information by referring to 

cated for the elements corresponding to all of the paths the created connection matrix, and allocating respective 

output from the basic block from which the paths branch probability information for each of the respec- 

corresponding to the respective element are output tive conditional decisions, 
10. A compilation processing method for performing a 65 

translation process for a program including at least one * * * * * 
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